AITopics | importance matrix

Collaborating Authors

importance matrix

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b6a171867138c80de2a35a6125d6757c-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-16-2026, 17:34:12 GMT

artificial intelligence, eigenvalue, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback

Overleaf Example

Neural Information Processing SystemsFeb-16-2026, 17:34:08 GMT

However, there still exist many properties of contrastive learning that are not guaranteed.

artificial intelligence, contrastive learning, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

A Proofs 421 A.1 Proof of Theorem 4.5

Neural Information Processing SystemsOct-9-2025, 05:33:43 GMT

In the next step, we prove the uniqueness of the optimal solution. Then we construct the generalization guarantee of tri-contrastive learning. Following the proof of Theorem 4.5, we know that the optimal solutions learned by triCL are With lemma A.1, we have Based on the proof of Theorem 4.5, we know that the We first extend the augmentation graph to an asymmetric form. We adopt ResNet-18 as the backbone. And for SCL, we randomly choose 20 dimensions.

artificial intelligence, eigenvalue, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.30)

Add feedback

Overleaf Example

Neural Information Processing SystemsOct-9-2025, 05:33:40 GMT

However, there still exist many properties of contrastive learning that are not guaranteed.

artificial intelligence, contrastive learning, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Jordan (0.04)
Asia > Japan > Honshū > Tōhoku > Iwate Prefecture > Morioka (0.04)
Asia > Japan > Honshū > Chūbu > Nagano Prefecture > Nagano (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

English K_Quantization of LLMs Does Not Disproportionately Diminish Multilingual Performance

Borgersen, Karl Audun

arXiv.org Artificial IntelligenceMar-10-2025

For consumer usage of locally deployed LLMs, the GGUF format and k\_quantization are invaluable tools for maintaining the performance of the original model while reducing it to sizes deployable with consumer-grade hardware. The number of bits dedicated to each weight from the original model is reduced based on how important they are thought to be during model inference. This importance is arrived at through the application of an 'importance matrix'-a relatively small text document meant to be representative of the LLM's standard use-cases. In the vast majority of quants available online, this document is primarily written in English. It was therefore an open question whether performance on English language tasks was preserved through the sacrifice of multilingual performance and whether it can be preserved with alternate importance matrices. This article investigates these hypotheses by quantizing Llama3.3 70B on importance matrices written in three languages (English, Norwegian, and Malayalam) and evaluating them on the MixEval dataset in both English and Norwegian. All experiments related to yielded non-significant results indicating that current quantization practices do not disproportionately harm multilingual performance.

dataset, importance matrix, llm, (13 more...)

arXiv.org Artificial Intelligence

2503.03592

Country:

Europe > Norway (0.15)
North America > United States > Kansas (0.05)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > Experimental Study > Negative Result (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Integrated feature analysis for deep learning interpretation and class activation maps

Li, Yanli, Hassanzadeh, Tahereh, Shamonin, Denis P., Reijnierse, Monique, Mil, Annette H. M. van der Helm-van, Stoel, Berend C.

arXiv.org Artificial IntelligenceJul-1-2024

Understanding the decisions of deep learning (DL) models is essential for the acceptance of DL to risk-sensitive applications. Although methods, like class activation maps (CAMs), give a glimpse into the black box, they do miss some crucial information, thereby limiting its interpretability and merely providing the considered locations of objects. To provide more insight into the models and the influence of datasets, we propose an integrated feature analysis method, which consists of feature distribution analysis and feature decomposition, to look closer into the intermediate features extracted by DL models. This integrated feature analysis could provide information on overfitting, confounders, outliers in datasets, model redundancies and principal features extracted by the models, and provide distribution information to form a common intensity scale, which are missing in current CAM algorithms. The integrated feature analysis was applied to eight different datasets for general validation: photographs of handwritten digits, two datasets of natural images and five medical datasets, including skin photography, ultrasound, CT, X-rays and MRIs. The method was evaluated by calculating the consistency between the CAMs average class activation levels and the logits of the model. Based on the eight datasets, the correlation coefficients through our method were all very close to 100%, and based on the feature decomposition, 5%-25% of features could generate equally informative saliency maps and obtain the same model performances as using all features. This proves the reliability of the integrated feature analysis. As the proposed methods rely on very few assumptions, this is a step towards better model interpretation and a useful extension to existing CAM algorithms. Codes: https://github.com/YanliLi27/IFA

cam, dataset, importance matrix, (13 more...)

arXiv.org Artificial Intelligence

2407.01142

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Netherlands > South Holland > Leiden (0.05)
Asia > China (0.04)

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area (0.95)
Health & Medicine > Diagnostic Medicine > Imaging (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Imputation-Free Learning from Incomplete Observations

Gao, Qitong, Wang, Dong, Amason, Joshua D., Yuan, Siyang, Tao, Chenyang, Henao, Ricardo, Hadziahmetovic, Majda, Carin, Lawrence, Pajic, Miroslav

arXiv.org Artificial IntelligenceJul-5-2021

Although recent works have developed methods that can generate estimations (or imputations) of the missing entries in a dataset to facilitate downstream analysis, most depend on assumptions that may not align with real-world applications and could suffer from poor performance in subsequent tasks. This is particularly true if the data have large missingness rates or a small population. More importantly, the imputation error could be propagated into the prediction step that follows, causing the gradients used to train the prediction models to be biased. Consequently, in this work, we introduce the importance guided stochastic gradient descent (IGSGD) method to train multilayer perceptrons (MLPs) and long short-term memories (LSTMs) to directly perform inference from inputs containing missing values without imputation. Specifically, we employ reinforcement learning (RL) to adjust the gradients used to train the models via back-propagation. This not only reduces bias but allows the model to exploit the underlying information behind missingness patterns. We test the proposed approach on real-world time-series (i.e., MIMIC-III), tabular data obtained from an eye clinic, and a standard dataset (i.e., MNIST), where our imputation-free predictions outperform the traditional two-step imputation-based predictions using state-of-the-art imputation methods.

dataset, gradient, prediction, (14 more...)

arXiv.org Artificial Intelligence

2107.01983

Country: